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ABSTRACT 


Let  (X1#  . ..,  X^)  be  a multinomial  vector  with  unknown 
cell  probabilities  (p^,  p^) . A subset  of  the  cells  is 

to  be  selected  in  a way  so  that  the  cell  associated  with 
the  smallest  cell  probability  is  included  in  the  selected 
subset  with  a preassigned  probability,  P*.  Suppose  the  loss 
is  measured  by  the  size  of  the  selected  subset,  S.  Using 
linear  programming  techniques,  selection  rules  can  be  constructed 
which  are  minimax  with  respect  to  S in  the  class  of  rules  which 
satisfy  the  P*-condition.  In  some  situations,  the  rule 
constructed  by  this  method  is  the  rule  proposed  by  Nagel 
(1970) . Similar  techniques  also  work  for  selection  in  terms 
of  the  largest  cell  probability.  The  rules  constructed  in  this 
fashion  are  also  minimax  for  selection  in  terms  of  Poisson 
parameters. 
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1.  INTRODUCTION 


In  this  paper,  subset  selection  problems  for  the  multi- 
nomial distribution  are  considered.  In  these  problems  the  aim 
is  to  select  a non-empty  subset  of  the  cells  which  contains 
the  cell  with  the  highest  or  lowest  cell  probability.  The 
goal  is  to  find  a selection  rule  which  includes  the  highest 
or  lowest  cell  probability  with  probability  at  least  equal  to 
a preassigned  number,  P*.  Having  satisfied  this  minimum 
requirement,  the  goal  is  to  find  a rule  which  effectively 
excludes  non-best  cells.  This  leads  to  the  use  of  the  number 
of  cells  selected  or  the  number  of  non-best  cells  selected  as 
a measure  of  the  loss  to  the  e>:vcrii  enter.  Minimax  rules  for 
these  losses  are  considered  and  the  main  result  of  this  paper 
is  that  minimax  rules  can  be  constructed  by  solving  the 
appropriate  linear  programming  problem.  The  rule  obtained 
in  this  way  in  some  situations  corresponds  to  a particularly 
simple  and  easy  to  implement  rule  proposed  and  studied  by 
Nagel  (1970) . This  rule  can  also  be  shown  to  have  another 
optimal  property  if  the  cell  probabilities  are  in  a slippage 
configuration. 

The  subset  selection  problem  for  multinomial  distributions 
has  been  previously  considered  by  Gupta  and  Nagel  (1967)  , Nagel 
(1970),  Tanchapakesnn  (’.971)  ard  Gu:  ta  and  Huang  (1S75)  . 

Kiniriax  subset  selection  rules  hav*.  been  recently  investigated 
in  a reneral  setting  by  Berger  (1979) . 
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Section  2 contains  the  necessary  notation  for  a formulation 
of  the  problem.  In  Section  3,  the  problem  of  choosing  the 
smallest  cell  probability  is  considered  and  the  fact  that  mini- 
max  selection  rules  can  be  constructed  using  linear  programming 
methods  is  proven.  Examples  of  rules  constructed  in  this  way 
are  given.  A particular  rule  which  arises  as  the  solution  to 
the  problem  is  further  considered  in  Section  4.  This  rule  is 
found  to  have  a certain  optimality  property  if  the  parameters 
are  in  a slippage  configuration.  In  Section  5,  the  analogous 
results  for  the  problem  of  selecting  the  largest  cell  proba- 
bility are  outlined.  In  Section  6,  the  fact  that  the  rules 
constructed  in  Sections  3 and  5 are  also  minimax  for  selection 
in  terms  of  Poisson  parameters  is  explained. 


2.  NOTATION  AND  FORMULATION 


Let  X ■ (X, , . . . , X,  ) be  a multinomial  random  vector  with 
k ~ 1 k 

[ X * N.  Let  p = (p,  , . . . , p ) be  the  unknown  cell 
i-1  1 k 1 

probabilities  with  [ p = 1.  The  ordered  c'-'ll  probabilities 

i=l  1 

will  be  denoted  by  pM1  s ...  s p , . The  goal  of  the  experi- 

Ll]  _ LkJ 

menter  is  to  select  a subset  of  the  cells  including  the  best 
cell,  the  cell  associated  with  ^ . A correct  selection,  CS, 
is  the  selection  of  any  subset  which  contains  the  best  cell. 

If  for  a particular  parameter  value,  p,  more  than  one  of  the 
cells  is  tied  with  thes  Hint  P^»  one  of  these  cells  will  be 
considered  to  be  tagged  and  a CS  occurs  if  the  tagged  cell  is 
selected.  This  assumption  is  not  of  essential  importance  and 
can  be  dropped  without  effecting  any  of  the  results.  But  t 
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«i  1 i'  uvl.e  ar  >:•  ents  involvinglimits  of  parameter  values. 

A selection  rule  will  be  denoted  by  * (x)  - U,(x),  ...»  (x) ) 

where  ♦^(x)  is  the  probability  of  including  the  ith  cell  in 
the  selected  subset  having  observed  X - x.  The  (x)  are  called 
the  individual  selection  probabilities.  To  insure  that  a non- 
empty subset  is  always  selected,  only  rules  which  satisfy 
k 

l 6 (x)  i 1 for  all  x will  be  considered.  The  first  requirement 
i-1  1 

a selection  rule  must  satisfy  is  that  it  has  a certain 
minimum  probability  of  selecting  the  best  cell.  Let 
P*,  1/k  < P*  < 1,  be  a preassigned  fixed  number.  Only  rules 
which  satisfy  the  P*- condition , viz., 

(2.1)  inf  P (CSU)  2 F*, 

P ~ 

will  be  considered.  The  set  of  rules  which  satisfy  the  P*- 
condition  will  be  denoted  by  . 

l 

Two  losses  which  are  commonly  used  in  subset  selection 

problems  are  the  size  of  the  selected  subset,  S,  and  the 

number  of  non-best  cells  selected,  S’.  The  risk  using  a 

rule  i when  p is  the  true  parameter  is  then  the  expected 

subset  size,  E (S|$),  or  the  expected  number  of  non-best  cells 
P ~ 

selected,  E (S’|$).  Let 
P 

k 

(2.2)  Six)  ■ J (x) . 

~ i-i  1 ~ 

Then  S(x)  is  the  expected  subset  size  having  observed 


X - x and  E (S|$)  - Ep(S(X)|$). 
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A selection  rule,  ♦ *,  is  said  to  be  minimax  with  respect  to 
S if  j*  ( and 

(2.3)  inf  sup  ( S | ♦ '/  - sup  E (S|$*). 

V p - ' K E * 

Replace  S by  S'  in  (2.3)  to  define  minimaxity  with  respect  to 
S'.  Berger  (1979)  investigated  minimaxity  with  respect  to  S 
and  S'  in  a general  setting.  In  Section  3,  selection  rules 
which  are  minimax  with  respect  to  S and  S'  will  be  constructed. 

3.  CONSTRUCTION  OF  MINIMAX  RULES 

In  Theorem  3.1,  the  following  class  of  selection  rules  will 
be  shown  to  be  minimax  with  respect  to  S and  S'.  Let  0*  be 
tne  class  of  selection  rules  which  satisfy:  (i)  4^(x)  - fi^x^) 

for  1 s i s k,  i.e.,  the  selection  or  rejection  of  the  i^*1  cell 
depends  only  on  the  number  of  observations  in  the  ifc^  cell; 

(ii)  f^(m)  * 6^(n)  ifOsmsnsN  and  1 s i s k,  i.e.,  the 

probability  of  selecting  the  ifc^  cell  decreases  as  the  number 

of  observations  in  the  i*-*1  cell  increases; 
k 

(iii)  S(x)  * £ fl.(x^)  is  a Schur  concave  function  of  x 

i“  1 

(iv)  E (6. (X.))  * P*  for  1 s i s k where  p.  ■ (1/k,  ...»  1/k) . 

E0  1 1 

Remark  3.1.  The  requirement  in  (i)  that  the  decision  to  include 

or  exclude  the  i cell  depends  only  on  may  seem  to  be  a 

waste  of  the  information  contained  in  the  other  X,  's.  But 
k J 

since  J X.  » N,  if  X.  is  "larqe"  then  the  other  X.'s  must  be 
i-1  x 1 J 


m 
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small  and  if  X is  "small”,  some  of  the  other  X.'s  must  be 
i D 

large.  So  information  about  the  other  X.'s  is  contained  in  X. 

J i 

and  the  requirement  is  not  as  counterintuitive  as  it  would  be 
in  a problem  in  which  the  Xj's  are  independent,  e.g.,  the 
normal  means  problem  (see  Gupta  (1965)).  The  rules  considered 
by  Nagel  (1970)  had  property  (i)  as  well  as  (ii)  and  (iv) . 


Remark  3.2.  Condition  (ii)  seems  reasonable  since  larger 
values  of  X^^  indicate  larger  values  of  p^. 

Remark  3.3.  See  Proschan  and  Sethuraman  (1977)  and  Nevius, 
Proschan  and  Sethuraman  (1977)  for  a discussion  of  Schur 
functions  in  statistics.  In  a sense,  x majorizes  ^ if  the 
coordinates  of  x are  more  spread  out  than  the  coordinates  of  y. 
The  more  spread  out  the  cell  frequencies  are,  the  easier  it 
should  be  to  decide  which  is  the  best  cell  and  the  smaller 
the  subset  that  need  be  selected.  This  is  an  interpretation  of 
condition  (iii) . 


Remark  3.4.  By  Theorem  4.1  of  Berger  (1979),  this  is  a necessary 
condition  if  a rule  is  to  be  minimax  with  respect  to  S or  S'. 


Remark  3.5.  Gupta  and  Nagel  (1967)  proposed  and  studied  the 
following  selection  rule  for  this  problem:  select  the  ifc^ 
cell  if  x^  s xmin  + c where  xm^n  - minfx^,  ...,  x^)  and  C is  a 
non-negative  integer  chosen  so  that  the  P*-condition  is  satisfied. 
In  general,  this  rule  is  not  in  V*  since  neither  (i)  nor  (iii) 


^ 


are  true.  In  the  special  case  of  k = 2,  however,  it  can  be 
verified  that  the  Gupta-Nagel  rule  is  in  V*  since  x0  = N - X^. 
Of  course,  to  satisfy  (iv)  exactly,  this  randomized  version  of 
the  Gupta-Nagel  rule  must  be  used: 


(x) 
1 ^ 


1 

1 <i 

.0 


xi  < xmin  + C 

Xi  * ’Snin  + C 
x.  > x . + C 


So,  by  the  following  theorem,  the  Gupta-Nagel  rule  is  minimax 
if  k - 2. 


Theorem  3.1.  If  # « V*  then  $ is  minimax  with  respect  to  S 
and  S ' . 


Proof . The  first  fact  to  be  verified  is  that  if  $ e V*  then 


♦ satisfies  the  P*-condition . Let  P.  = {p:  p.  ip.  i 0 for  all 
~ 3 i 

j * i and  \ p.  = 1}.  Then  inf  P-,  (CSjg)  = inf  inf  P (select 

j-1  J P - lsisk  Pi  p 

i^  cell  |*)  - inf  inf  EL(<J>.  (X) ) . Let  Y have  a binomial 
lsisk  PL  ~ 1 ~ 

distribution  with  parameters  N and  p^^ . Then  E (<^(3))  = 


Since  the  binomial  distribution  has  monotone 


(6. (Y) ) 
pi  1 

likelihood  ratio  and  since  6^  is  a 'ia  : /.easing  function, 


by  Lemma  2,  page  74  of  Lehmann  (1959),  E (6,( Y) ) is  nonin- 

pi  1 

creasing  in  p. . So  the  inf  &,(*. (X))  = inf  E (6.(Y))  = 

1 P ~ 1 P.  pi  1 


P. 

1 


2 .,.(ii(Y))  - P*.  Thus  inf  P (CS|$)  = inf  P*  * P*  and  * 

pi  1/k  1 P p lsisk 


uatiufica  the  P*-condition.  ~ 
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Now  (2.3)  will  be  verified.  By  Theorem  3.1  of  Berger 
(1979),  the  minimax  value  is  so  it  suffices  to  show  that 

sup  E (S|$)  = ,;p.<  for  any  $ < P*.  But,  since  S(x)  is 

P E. 

Schur  concave,  by  Application  4.2(a)  of  Nevius,  Proschan  and 

Sethuraman  (1977)  El^-,  (S  (X)  | ^)  is  Schur  concave  in  p and  hence 

takes  on  the  maximum  value  at  p = (1/k,  ...,  1/k) . 
k k~° 

E (S(X)|$)  = l E.  *.(X)  - [ E..  ( 5 . (X . ) ) = kp*  by  (iv)  . 

?0  ~ ~ i=i  ~o  i=l  ~0  1 1 

So  (2.3)  is  verified. 

Finally,  by  Theorem  3.2  of  Berger  (1979),  if  $ is 
minimax  with  respect  to  S then  <£  is  also  minimax  with  respect 
to  S'.  || 

Other  authors  have  provided  bounds  on  E(S)  for  the  rules 
they  have  proposed.  For  example,  Gupta  and  Huang  (1975) 
give  an  upper  bound  for  E(S)  when  the  parameters  are  in  a 
slippage  configuration.  But  the  result  of  Theorem  3.1  is 
stronger  in  that  minimaxity  considers  all  parameter  configura- 
tions and  the  exact  upper  bound  of  kF*  for  E(S)  is  achieved. 

D*  is  a wide  class  of  selection  rules  which  are  minimax. 
Finding  one  rule  in  V * which  has  an  additional  optimality 
property  may  be  accomplished  by  solving  the  following  linear 
programming  problem.  Consider  6 = (6^(0),  ...,  6^ (N) , 

6^(0) , ...,  60(N),  ...,  5k(0),  «k(N))  as  the  solution 

vector  for  which  we  wish  to  solve.  Condition  (ii)  provides 
kN  linear  constraints  on  the  solution.  Condition  (iii) 
provides  additional  linear  constraints  on  the  solution.  For  ex- 
ample, since  (6,  2,  0)  majorizes  (5,  3,  1),  we  must  have 


«x  (6)  + 6 2 ( 2 ) + 6 3 (0)  s 6 x ( 5)  + 69(3)  + 63(1).  Finally, 
condition  (iv)  provides  k additional  linear  constraints  on 
the  solution.  Subject  to  these  constraints  we  wish  to  minimize 
Eg'<sl£)'  a linear  function  of  the  coordinates  of  6,  for 
some  p*  * Pq.  The  parameter  p'  could  be  some  particular 
parameter  value  in  which  the  experimenter  is  particularly 
interested.  As  an  example,  this  problem  was  solved  for 
P*  = . 4 ( . 1 ) .9  and  g'  * (.1,  .45,  . 1j)  and  g'  = (.2,  .4,  .4) 

in  the  case  where  k = 3 and  N - 6.  The  resulting  minimax 
rules  are  shown  in  Table  1.  In  finding  these  solutions,  the 
additional  constraint  was  made  that  the  solution  was  permuta- 
tion invariant,  i.e.,  fi^m)  = 6 ^ (m)  for  all  1 s i,  j s k 
and  all  0 s m s N.  These  solutions  were  obtained  using  the 
NYBLPC  computer  program.  This  program  uses  the  criss-cross 


method  as  developed  by  Dr.  Stanley  Zionts.  The  computations 
were  done  on  the  CDC  CYBER  74  at  the  Florida  State  University 
Computing  Center. 


4,  A SIMPLE  RULE 


An  examination  of  Table  1 reveals  that,  in  all  cases  when 
P*  is  large,  the  optimal  rule  obtained  by  solving  the  linear 
programming  problem,  as  outlined  in  Section  3,  has  the 


following  simple  form; 


1 x.  < t 
1 


(4.1) 


>?  (x)  « 6*(x  . ) = J a x . 
1 ~ 1 1 


0 xt  > t 
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Here  t and  a are  chosen  so  that  condition  (iv)  (Section  3)  is 
satisfied.  t'anely,  let  Y have  a binomial  distribution  with 
parameters  1/k  and  N.  Then  t is  the  integer  which  satisfies 
P (Y  < t)  i P*  < P(Y  < t)  and  ct  = (P*  - P (Y  < t))/P(v  = t)  . 

This  rule  is  analogous  to  a rule  proposed  by  Nagel  (1970)  for 
selecting  the  largest  cell  probability.  This  rule  is  appealing 
for  its  simplicity  and  because  the  constants  t and  a,  needed 
for  implementation,  can  be  easily  obtained  from  a binomial 
table.  In  this  section  the  rule  is  studied  more  closely. 
First,  in  Theorem  4.1,  is  shown  to  be  minimax  if  P*  is 
sufficiently  large.  Then,  in  Lemmas  4.1  and  4.2  and  Theorem 
4.2,  is  shown  to  have  the  following  optimality  property  if 
the  parameters  are  in  a slippage  configuration.  Suppose  all 
the  cell  probabilities  are  equal  except  for  the  ifc 
minimizes  the  probability  of  selecting  the  ifc^  cell  if  its 
cell  probability  is  larger  than  all  the  rest  and  maximizes  the 
probability  of  selecting  the  ith  cell  if  its  cell  probability 
is  smaller  than  all  the  rest  among  all  minimax  rules. 

Theorem  4.1.  Let  Y have  a binomial  distribution  with  parameters 
1/k  and  N.  Then,  if 

(4.2)  P*  > P(Y  < |)  + * p(Y  = N) f 

<t>*  is  minimax  with  respect  to  S and  S'. 

Proof . By  Theorem  3.1  it  suffices  to  show  e V* . Conditions 
(i) , (ii)  and  (iv)  are  obviously  satisfied  by  the  definition  of 


1 

i 
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<t*.  It  remains  to  show  that  (iii)  is  true,  i.e.,  S*  (x)  = 
k 

J 6* (x . ) is  a Scnur  concave  function  of  x. 

1=1 

To  see  that  S*  is  Schur  concave,  suppose  x majorizes  y 
k k ~ 

where  \ y = N = \ ’ . By  the  result  of  Hardy,  Littlewood 

4=1  1 4=1  1 

and  Polya  (1952,.  page  47),  we  may  assume,  without  loss  of 
generality,  chat  x and  j'  differ  in  two  coordinates  only,  say, 
x-  > y.  £ y.  > x.  where  x;  + x.  = y.  + y..  Since  the  constants 
t and  a are  chosen  so  that  P (Y  < t)  + a P(Y  = t)  = P*,  (4.2) 
implies  that  either  t > - or  t = ^ and  a £ i.  Since  all  of 

C.  £ * 

11 

the  coordinates  of  x and  % are  equal,  except  the  i1"  and  j , 

S*(y)  - S*(x)  = 6*(y.)  + 6*(y.)  - 6*(x.)  - 6*(x.).  Since 
v-  ~ ~ i J 1 J 

r N 

) y = N and  y £ 0,  1 < 4 s k,  y.  s — s t.  If  y.  <t  then 
4=1  1 1 3 2 3 

6*  (y  . ) = 6*(x.)  = 1 and  6*  (y . ) s 5*(x.)  so  S*(y)  - S*  (x)  z 0. 

7 3 h*  a **  u 

If  t = y_.  then  t = ~ ■ = Yj  = y^  and  S*(]£)  - S*(x)  = a + a-  0-  la0 

since  u i i if  t = j.  Thus  S*  is  Schur  concave.  | | 

Values  of  the  lower  bound  given  in  (4.2)  are  tabled  in 
Table  2 for  k = 2(1)  10  and  N = 1(1)  20.  Table  2 reveals  that, 
for  small  values  of  k and  N,  tl is  lower  bound  is  reasonably 
small.  But  as  k or  N increases,  the  lower  bound  converges  to 
one.  So  Theorem  4.1  shows  that  is  minimax  only  for  moderate 
values  of  k and  N.  It  can  be  shown  that  if  N is  an  odd 

number  then  the  expression  given  in  (4.2)  is  the  same  for  both 

N and  N + 1.  That  is  why  one  column  suffices  for  each 
consecutive  odd-even  pair  in  Table  2. 

Now  the  behavior  of  when  the  parameter  is  in  a 
slippage  configuration  will  be  examined.  For  the  remainder  of 


11 


L 


this  section  let  g'  - (p,  . ..,  p,  q,  p,  . ..,  p)  where  (k  - l)p  + q ■ 1 


ana  q is  the  it  coordinate.  The  following  result  will  be 
proven. 


Theorem  4.2.  Suppose  is  minimax.  Then  among  all  minimax 


.th 


rules,  maximizes  P t (select  i 

~ ~ P 

minimizes  P (select  ifcl1  cell  l^)  if  q > p. 
~ P 


cell  | $)  if  q < p and 


The  proof  of  Theorem  4.2  will  be  accomplished  via  the 
following  two  lemmas. 


Lemma  4.1.  Suppose  q < p.  Among  all  rules,  which  satisfy 


P (select  ith  cell  U)  s P*,  maximizes  P_, (select  ith 
P ~ ~ P 


cell  |+). 


Proof.  For  any  p and  $,  P_(select  itl^  cell  |<J>)  * E ♦.(X). 

_ IV  IV  ^ M p 2.  **** 

By  the  Neyman-Pearson  Lemma,  among  all  rules  which  satisfy 
E (X)  s P*,  a rule  which  maximizes  E , <f> . (X)  is  given  by 

Eq  1 ~ E 1 ~ 


(4.3) 


♦ i (x) 


(N) 

X 


*i  N-x  i 
I P 


(?)  k“N 


> t 


where  t and  a are  chosen  so  that  E . (X) 

Eq  1 ~ 


P*.  Using  the 

fact  that  q < p,  it  can  be  seen  that  (4.3)  is  equivalent  to 
(4.1).  || 
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Lemma  4.2.  Let  q > p.  Among  all  rules,  4>  r which  satisfy 
P (select  cell  U)  z P*,  41*  minimizes  P (select  ith 

E0  P 

cell  | $) . 

Proof . The  proof  follows  the  same  lines  as  that  of  Lemma  4.1.  | | 

Proof  of  Theorem  4.2.  By  Theorem  4.1  of  Berger  (1979), 
every  rule,  $,  which  is  minimax  with  respect  to  S or  S'  must 
satisfy  P.,  (select  ith  cell  I*)  = P*.  So  Theorem  4.2  follows 
from  Lemmas  4 . 1 and  4.2.  | | 

Seal  (1955,  Section  4)  examines  a slippage  problem, 
analogous  to  the  one  considered  in  Theorem  4.2,  for  the 
normal  means  selection  problem.  He  shows,  "approximately," 
that  a certain  rule  has  the  property  that  has  in  the 
multinomial  problem.  "Approximately"  is  Seal's  term.  It 
refers  to  the  fact  that  he  used  an  asymptotic  argument.  But 
his  result  can  be  proved  exactly  using  the  ifo/aian-Pearson 
Lemma  as  in  Lemmas  4 . 1 and  4.2. 

5.  SELECTION  IN  TERMS  OF  THE 
LARGEST  CELL  PROBABILITY. 

In  some  problems  the  experimenter  might  be  interested 
in  selecting  a subset  of  the  cells  including  the  cell 

' 

associated  with  p , the  largest  cell  probability.  In  this 

L K J 

T 

problem  a correct  selection,  CS,  is  the  selection  of  any  subset 
including  the  cell  associated  with  In  this  section, 

' ' 

results  analogous  to  those  found  in  Sections  3 and  4 will  be 
briefly  outlined. 

(j 
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The  following  class  of  selection  rules  can  be  3hown  to 

be  minimax  with  respect  to  S and  S'.  Let  V * be  the  class  of 

selection  rules  which  satisfy  (i),  (ii‘),  (iii)  and  (iv) 

where  (i)  , (iii)  and  (iv)  are  as  in  Section  3 and  ( i i ‘ ) 

6^(m)  s 6^(n)  ifOsmsnsN  and  1 s i s k,  i.e.,  the 

fcbi 

probability  of  selecting  the  i cell  increases  as  the  number 
of  observations  in  the  i1"^1  cell  increases. 

Theorem  5.1.  If  | c P*  then  £ is  minimax  with  respect  to  S 
and  S ' . 

Proof . This  proof  follows  the  lines  of  the  proof  of  Theorem 

3.1  where  now  (ii')  is  used  to  show  that  if  <£  c D*  then  $ 

satisfies  the  P*-condition . i 

Rules  in  V * can  be  constructed  by  solving  a linear 

programming  problem.  The  condition  that  £ c 0*  puts  linear 

constraints  on  the  vector  of  selection  probabilities 

6 = (6,(0),  . . . , 6,  (N) ) . The  6 which  minimizes  E (S I 6)  for 
~ l k ~ B’ 

some  g'  * Pg , subject  to  the  constraint  6 c P*  can  be  solved  for. 

This  was  done  for  P*  = . 4(..  1)„9  and  jo’  = (.8,  .1,  .1)  and 

p"  = (.6,  .2,.  .2)  in  the  case  where  k = 3 and  N = 6.  The 
~ 

same  rule  was  found  to  be  optimal  for  both  p'  and  p".  The 
resulting  minimax  rule  is  presented  in  Table  3. 

A particularly  simple  rule,  which  arose  as  the  solution 


to  the  linear  programming  problem  when  P*  was  large,  is  the 
following : 
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(5.1) 


1 

if 

xi 

> t 

(X)  = 6* (xi)  * - 

a 

if 

x . 

l 

= t 

.0 

if 

xi 

< t 

where  a and  t are  chosen  so  that  condition  (iv)  is  satisfied. 

is  the  rule  proposed  by  Nagel  (1970).  like  in 

Section  4,  is  minimax  if  P*  is  large  and  performs  well  in 
the  slippage  configuration.  Specifically,  the  following 
two  theorems  are  true. 


Theorem  5.1.  Let  Y have  a binomial  distribution  with  parameters 
1/k  and  N.  Then,  if 

(5.2)  P*  5 P (Y  > 1)  + j P (Y  = 1), 

<f>*  is  minimax  with  respect  to  S and  S'. 

Proof.  If  (5.2)  is  true  then  e V. . The  key  point  to 

k 

verify  is  that  S.  (x)  = V <5 . (x  ) is  Schur  concave.  This  can  be 

i=l  * i 

verified  using  the  fact  that  (5.2)  implies  either  t = 1 and 
a 5 i or  t *=  0 || 

For  fixed  values  of  N and  P*,  the  lower  bound  given  in 

(5.2)  is  a decreasing  function  of  k.  So  for  fixed  N and  P*, 
will  be  minimax  if  k is  sufficiently  large.  The  value  of 
the  lower  bound  given  in  (5.2)  is  tabulated  for  N = 1 (1)  20 

and  k » 2 (1)  10  in  Table  4. 

Finally,  performs  well  if  the  parameters  are  in  a 
slippage  configuration.  Let  p'  = (p,  ...,  p,  q,  p,  ...,  p) 

where  (k-l)p+q=l  and  q is  the  ifc^  coordinate.  The 


1 


•I 


following  result,  analogous  to  Theorem  4.2,  is  true. 
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Theorem  5.2.  Suppose  £ is  minimax.  Then  among  all  minimax 
rules,  maximizes  P , (select  1 cell  |$)  if  q > p and 

'v  1.  U 

minimizes  P, ( (select  i cell  |^)  if  q < p. 

The  proof  of  Theorem  5.2  is  completely  analogous  to  the 
proof  of  Theorem  4 2 and  is  omitted. 

6.  SELECTION  RULES  FOR  POISSON  PARAMETERS. 


The  selection  rules  constructed  in  Sections  3 and  5 can 

also  be  used  for  selection  problems  involving  Poisson  parameters. 

These  rules  will  also  be  minimax  for  these  Poisson  selection 

problems.  Selection  rules  for  the  Poisson  distribution  have 

previously  been  studied  by  Gupta  and  Nagel  (1971)  and  Gupta 

and  Huanq  (1975) . Goel  (197 2)  showed  that  the  usual  location 

and  scale  type  selection  rules  do  not  satisfy  the  P*-condition 

in  this  problem  for  large  values  of  P*. 

Specifically  let  V = (V  V ) be  independent  Poisson 

random  variables  with  parameters  X = (X,,  ...,  X.). 

J.  1 k 

Let  N * \ Y . The  goal  is  to  select  a subset  of  the 

i=l  1 

population  including  the  one  associated  with  the  largest  or 

smallest  parameter  X . . The  conditional  distribution  of  Y 

l ~ 

given  N * N is  multinomial  with  parameters  p and  N where 
k 

p.  * 1.  So  if  W = N,  we  can  use  the  rule  constructed 

1 j“l  3 

in  Section  3 or  5 for  a sample  of  size  N.  This  rule  lias  the 

property  that  P (CS | N « N)  s P*  for  0 - N < - and  all  X.  So 
A ~ 

unconditionally,  P (CS)  5 P*  for  all  X,  using  this  rule,  i.e., 

L ~ 

the  rule  r.ati  sf  ies  the  P*-condition . Similarly  E^  (S)  s krr 


I 
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for  all  X since  E (S  | W =»  N)  s kP*  for  0 <;  N < * and  all  X,  i.e., 

the  rule  is  minimax  with  respect  to  S.  M is  a complete 

sufficient  statistic  for  X on  the  set  il  = { X : X = ...  = x,  }. 

~ 0 ~ 1 k 

By  Lemma  1.1  of  Gupta  and  Nagel  (1971)  if  P,(CS|<j,)  r P*  on  fln, 
then  PX(CS|N  = N)  = P*  for  0 <;  K < But  P (CS|$)  = P*  on  is 

a necessary  condition  for  minimaxity  by  Theorem  4.1  of  Berger 
(1979) . So  considering  rules  conditional  on  W is  natural  in 
this  problem. 
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Table  2 


Lower  bound  for  P* 

If  P*  2 table  entry,  is  minimax. 


N 

1.2 

3,4 

5,6 

7,8 

9,10 

11,12 

13,14 

15,16 

17,18 

19,20 

2 

.5000 

.5000 

.5000 

.5000 

.5000 

.5000 

.5000 

.5000 

.5000 

.5000 

3 

.6667 

.7407 

.7901 

.8267 

.8552 

.8779 

.8965 

.9118 

.9245 

.9352 

4 

.7500 

.8438 

.8965 

.9294 

.9511 

.9657 

.9757 

.9827 

.9876 

.9911 

5 

.8000 

.8960 

.9421 

.9667 

.9804 

.9883 

.9930 

.9958 

.9974 

.9984 

6 

.8333 

.9259 

.9645 

.9324 

.9910 

.9954 

.9976 

.9937 

.9993 

.9996 

7 

.8571 

.9446 

.9767 

. 9 >98 

.9955 

.9979 

.9991 

.9996 

.9998 

.9999 

8 

.8750 

.9570 

.9839 

.9938 

.9975 

.9990 

.9996 

.9998 

.9999 

1 . 0000 

9 

.8889 

.9657 

.9885 

.9960 

.9986 

.9995 

.9998 

.9999 

1 . 0000 

1.0000 

10 

.9000 

.9720 

.9914 

.9973 

.9991 

.9997 

.9999 

1 . 0000 

1.0000 

1.0000 

Table  3 


Kinimax  selection  rule  tor  p , , which 

* 1 k J 

minimizes  F.^, (S)  and  (S) 


= ( - 13 , -1/  -1)  i>  3 (-6,  . 2 , .2)  N=>C  k = 
6(x)  = P (select  ita  cell  | * x) 


.9 

. 8 

.7 

.6 

. 5 

. 4 

0 

.000 

.000 

.000 

.000 

.000 

.000 

1 

.954 

.574 

.448 

. 384 

. 320 

.253 

2 

1.000 

1.000 

.897 

.768 

.639 

.507 

3 

1.000 

1.000 

.897 

. 768 

.639 

.507 

4 

1.000 

1.000 

. 897 

.768 

.639 

.507 

5 

1.000 

1.000 

. 897 

. 768 

.680 

. 747 

6 

1.000 

1.  oco 

1.000 

1.000 

1.000 

1.000 
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Table  4 

Lower  bound  for  P* 


If 

3 

P*  i 

4 

table 

5 

entry , 

6 

£*  is 

7 

mini max 

8 9 

10 

.2500 

. 1 6t*7 

. 1250 

. 1000 

.0833 

.0714 

.0625 

.0556 

. 0500 

.5000 

. 3333 

.2500 

. 2000 

. 1667 

. 1429 

.1250 

.1111 

. 1000 

.t>875 

.4815 

. 36  ”2 

. 2960 

.2477 

.2128 

. 1 865 

. 1660 

. 1495 

.8125 

.0049 

.4727 

.3856 

. 3248 

. 2803 

.2463 

. 2 1 9(> 

.1981 

. 8906 

. 703” 

. 5649 

.4675 

. 3972 

.3146 

.3039 

.2717 

.2455 

.0375 

.7805 

.6440 

.5112 

. 1642 

.4051 

. 3589 

.3218 

.2914 

.9c>48 

. 8390 

.7108 

. 0O08 

.5256 

.4618 

.4110 

. 3697 

.3357 

.9805 

.8829 

. 7 064 

. 6645 

.5814 

.5144 

. 4600 

.4154 

.3782 

.9893 

.9155 

.8123 

.7148 

. 63 1 8 

.5630 

.5061 

.4587 

.4189 

.9941 

. 9393 

. 8498 

. 7584 

.6770 

.6076 

.5490 

.4996 

.4576 

. 99t>8 

. 9560 

.8803 

. 7960 

.7174 

. 6483 

.5889 

.5381 

.4944 

.9983 

. 9o92 

. 9050 

.82  » 2 

. 7533 

. 6855 

. 6259 

.5742 

.5293 

. 999 1 

. 9"S2 

.9248 

.8557 

.7850 

.7192 

. 6601 

. 6080 

.5622 

. 9995 

.9840 

. 94  lo 

.8791 

.8131 

."'197 

. 69 1 6 

.6395 

. 5933 

.9997 

.9892 

.9532 

. 8988 

. 8377 

.7772 

.7205 

. 6t>89 

.6225 

. 9999 

.9924 

.96.33 

.9156 

.8594 

.8019 

. 74  70 

. 6962 

.6500 

. 9999 

. 994  7 

.9712 

.9296 

.8783 

.8242 

. 771 2 

.7215 

. 6757 

1 . 0000 

. 9963 

.9774 

.9415 

.8948 

.8441 

. 7934 

.7450 

. 6998 

l . 0000 

.9974 

.9824 

.9514 

.9092 

. 8619 

.8136 

. 7666 

. 7223 

1 . 0000 

. 998  2 

.9863 

. 9596 

.9217 

.87?8 

. 8319 

. 7866 

.7433 
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class  of  rules  which  satisfy  the  P* -condition.  In  some  situations,  the  rule  constructed  by 
this  method  is  the  rule  proposed  by  Nagel  (1970)  Similar  technivjues  also  work  for  selection 
in  terms  of  the  largest  cell  probability.  The  rules  constructed  in  this  fashion  are  also 
minim  ix  for  selection  in  terns  of  Poisson  parameters. 
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